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language data, CTS researchers are able to draw 
conclusions about translation that are grounded in 
empirical evidence rather than intuition or 
speculation. 

Data-driven approach is a key feature of 
corpus-based translation studies (CTS), which is a 
subfield of translation studies that focuses on the 
use of language corpora to better understand the 
nature of translation and the processes involved in 
it. In this article, we will explore the importance of 
empirical research and data-driven approach in 
CTS and some of the methods that are commonly 
used. 
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MAQOLA HAQIDA 
Kalit soʻzlar: Empirik, korpus, Annotatsiya: Empirik tadqiqotlar korpus 
tarjimashunoslik, tarjima, tadqiqot. asosidagi tarjimashunoslikning (CTS) muhim 
jihati boʻlib, tarjimashunoslikning kichik sohasi 
boʻlib, tarjimaning mohiyatini va unda ishtirok 
etayotgan jarayonlarni yaxshiroq tushunish 
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uchun til korpusidan foydalanishga qaratilgan. 
Haqiqiy til ma'lumotlarini tahlil qilish orqali CTS 
tadqiqotchilari tarjima haqida sezgi yoki 
taxminlarga emas, balki empirik dalillarga 
asoslangan xulosalar chiqarishlari mumkin. 
Ma'lumotlarga asoslangan yondashuv - 
korpusga asoslangan tarjimashunoslikning (CTS) 
asosiy xususiyati bo'lib, u tarjimaning mohiyatini 
va unga aloqador jarayonlarni yaxshiroq 
tushunish uchun til korpusidan foydalanishga 
qaratilgan tarjimashunoslikning kichik sohasi 
hisoblanadi. Ushbu maqolada biz CTSda empirik 
tadqiqotlar va ma'lumotlarga asoslangan 
yondashuvning ahamiyatini va tez-tez 
qo'llaniladigan ba'zi usullarni o'rganamiz. 
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О СТАТЬЕ 
Ключевые слова: Эмпирический, Аннотация: Эмпирические исследования 
корпус, переводоведение, перевод, являются важным аспектом переводоведения 
исследование. на основе корпусов (CTS), которое 


представляет собой подполе переводоведения, 
фокусирующееся на использовании языковых 
корпусов для лучшего понимания природы 
перевода и связанных с ним процессов. 
Анализируя аутентичные языковые данные, 
исследователи CTS могут делать выводы о 
переводе, основанные на эмпирических 
данных, а не на интуиции или 
предположениях. 

Подход, основанный на данных, является 
ключевой особенностью переводоведения на 
основе корпусов (CTS), которое представляет 
собой подполе переводоведения, которое 
фокусируется на использовании языковых 
корпусов для лучшего понимания природы 
перевода и процессов, связанных с ним. В этой 
статье мы рассмотрим важность 
эмпирических исследований и подхода, 
основанного на данных, в CTS, а также 
некоторых широко используемых методов. 


INTRODUCTION 
Corpus-based translation studies (CTS) is a rapidly growing field that is primarily concerned 


with the study of translation using corpora, or large collections of language data. Empirical research 
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and a data-driven approach are essential to CTS because they provide a systematic, objective and 
verifiable method for investigating translation phenomena. 
THE MAIN RESULTS AND FINDINGS 

Empirical research involves the collection and analysis of data through observation and 
experimentation. In CTS, empirical research is typically carried out using corpora, which are large 
collections of texts that have been electronically stored and can be searched and analyzed using 
specialized software. These corpora can be used to study a wide range of translation-related 
phenomena, such as translation strategies, stylistic differences between translations, and the impact 
of cultural factors on translation. 

One of the major advantages of empirical research in CTS 1$ that it allows researchers to identify 
patterns and regularities in translation data that might not be immediately apparent through intuitive 
or subjective analysis. For example, by analyzing a large corpus of translated texts, researchers can 
identify common translation strategies used by translators, such as the use of loanwords, calques, or 
paraphrasing. These patterns can then be used to develop more effective translation methodologies 
and tools. 

A data-driven approach is also central to CTS because it allows researchers to use statistical 
methods to analyze large amounts of data and draw meaningful conclusions. Statistical analysis can 
be used to identify patterns in the data, such as differences in vocabulary use between source and 
target texts, or the frequency of certain translation strategies. Statistical models can also be used to 
predict translation outcomes, such as the likelihood of a particular word or phrase being translated in 
a certain way. 

In addition to providing a more objective and systematic approach to translation research, 
empirical research and a data-driven approach also have the potential to improve the quality of 
translations. By identifying patterns in translation data, researchers can develop more accurate 
translation models and tools that can be used to improve the quality and consistency of translations. 

One of the main advantages of empirical research is that it allows researchers to test hypotheses 
and make predictions about translation based on concrete data. This is in contrast to more traditional 
approaches to translation studies, which often rely on anecdotal evidence or theoretical speculation. 
Empirical research provides a more rigorous and systematic way of studying translation, which can 
lead to more reliable and accurate conclusions. 

Empirical research in CTS typically involves the use of large language corpora that contain a 
wide range of different types of texts and translations. These corpora are annotated with metadata and 
other information that allows researchers to identify patterns and regularities in translation. For 
example, a corpus might contain parallel texts in two different languages, such as the English and 
French versions of a novel. By analyzing the translations in the corpus, researchers can identify 
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common translation strategies, such as the use of loanwords or the translation of idiomatic 
expressions. 

One of the most important tools used in empirical research in CTS is the concordance. A 
concordance is a listing of all the occurrences of a particular word or phrase in a corpus, along with 
the surrounding context. By examining concordances, researchers can identify patterns in the way 
that words and phrases are translated. For example, a concordance might reveal that a particular word 
is usually translated in a certain way in a given context, or that certain types of texts tend to use certain 
translation strategies more frequently than others. 

Another important method used in empirical research in CTS is statistical analysis. By applying 
statistical techniques to the data in a corpus, researchers can identify patterns and trends that might 
not be immediately apparent from a qualitative analysis of the data. For example, statistical analysis 
might reveal that certain translation strategies are more common in certain types of texts or that certain 
translators tend to use certain strategies more frequently than others. 

The data-driven approach in CTS involves the use of language corpora to identify patterns and 
regularities in translation. By analyzing large amounts of authentic language data, researchers are 
able to gain insights into the nature of translation that would be impossible to obtain through intuition 
or theoretical speculation. 

One of the key advantages of the data-driven approach is that it allows researchers to identify 
the most common translation strategies and linguistic features used in different types of translations. 
For example, a corpus of legal translations might reveal that certain legal terms are consistently 
translated in a certain way, while a corpus of literary translations might reveal that certain types of 
metaphors are more likely to be translated literally or figuratively. 

The data-driven approach in CTS also allows researchers to compare and contrast translations 
across languages. By analyzing parallel texts in two or more languages, researchers can identify cross- 
linguistic differences in translation strategies and techniques. For example, a corpus of English- 
French translations might reveal that certain words or phrases are more likely to be translated 
differently in the two languages. 

One of the methods commonly used in the data-driven approach in CTS is corpus annotation. 
Corpus annotation involves the manual or automatic tagging of language data with metadata that 
allows researchers to identify patterns and regularities. For example, a corpus of legal translations 
might be annotated with information about the source and target languages, the type of legal document 
being translated, and the translation strategy used. 

Another method commonly used in the data-driven approach in CTS is concordancing. 


Concordancing involves the extraction of all occurrences of a particular word or phrase in a corpus, 
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along with the surrounding context. By examining these concordances, researchers can identify 
patterns in the way that words and phrases are translated. 

The data-driven approach in CTS is a powerful tool for understanding the nature of translation 
and the processes involved in it. By analyzing large amounts of authentic language data, researchers 
are able to identify patterns and regularities that would be impossible to obtain through intuition or 
theoretical speculation. This approach has led to many important insights into translation, and it is 
likely to continue to be a central focus of research in CTS for many years to come. 

CONCLUSION 

In conclusion, empirical research and a data-driven approach are essential to CTS because they 
provide a systematic, objective and verifiable method for investigating translation phenomena. By 
using corpora and statistical analysis, researchers can identify patterns in translation data that can be 
used to improve translation methodologies and tools, and ultimately improve the quality and 
consistency of translations. 
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